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We investigate theoretically the physics of diffusion-limited intramolecular polymer reactions. The present 
work completes and goes beyond a previous study [Nat. Chem. 4, 268 (20f 2)] that showed that the distribu- 
tion of the polymer conformations at the very instant of reaction plays a key role in the cyclization kinetics, 
and takes explicitly into account the non-Markovian nature of the reactant motion. Here, we present in detail 
this non-Markovian theory, and compare it explicitly with existing Markovian theories and with numerical 
stochastic simulations. A large focus is made on the description of the non-equilibrium reactive conforma- 
tions, with both numerical and analytical tools. We show that the reactive conformations are elongated and 
are characterized by a spectrum with a slowly decreasing tail, implying that the monomers that neighbor the 
reactive monomers are significantly shifted at the instant of reaction. We complete the study by deriving 
explicit formulas for the reaction rates in the Markovian Wilemski-Fixman theory when the reactants are lo- 
cated in arbitrary positions in the chain. We also give a simple scaling argument to understand the existence 
of two regimes in the reaction time, that come from two possible behaviors of monomer motion which can be 
either diffusive or subdiffusive. 
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I. INTRODUCTION 

Determining how fast two reactants attached to a poly- 
mer come into contact is an old problem of statistical 
mechanics-^ . When a reactant molecule is attached to 
a polymer, its interaction with the whole polymer ch ain 
results in a complex motion that can be subdiffusivd^ 
and leads to non-trivial reaction kinetic^SESl xhe cy- 
clization reaction between the two end monomers is an 
important example of intramolecular react ion and h as 
been extensively studied, both theoreticalljffl^IlIIllinHiSl 
and experimentally, for example in the context of hairpin 
formation in nucleic acidi^^''^ or the folding of polype- 
tide chainJ^SHSSl Indeed, the cyclization of a polypeptide 
chain can be seen as an elementary step of the folding 
pathwajii^. In the context of nucleic acids, the forma- 
tion of loops and hairpins in DNA is a key process in the 
regulation of gene expressiorPS, while the cyclization of 
molecular beacons can be use d as a tool for the recogni- 
tion of nucleic acid sequence d^^ l ^^ l More generally, the 
cyclization time is of interest for example because it con- 
trols in part the formation of rings in the polymerization 
reaction^^H 

In this paper, we study the diffusion controlled reaction 
between two reactive groups that are attached to a single 
polymer chain. The theoretical description of polymer 
reaction kinetics is made complicated by the structural 
dynamics of the chain: the position of a single monomer 
cannot be described as a Markov process, because it re- 
sults from the interactions between all the monomers of 
the chain. Here we focus on the non-Markovian effects 
and we consider the simple case where the polymer is 
modeled by a Rouse chain of beads and springs where 
both hydrodynamic and excluded volume interactions are 
neglected. Despite its simplicity, this model catches some 



important aspects of polymer dynamicd^, and the cal- 
culation of the mean cyclization time is not trivial due to 
the presence of the non-Markovian effects that we aim at 
describing. Classical approaches on polymer reaction ki- 
netics rely on "Markovian appr oxim ations" , such as the 
harmonic spring approximatioiP'^ (where the polymer 
is approximated by a single spring and the problem is 
therefore Markovian), or the more refined clo sure ap- 
proximation in the Wilemski-Fixman theorjEH, which is 
a local equilibrium assumption for the whole polymer. 
Because memory effects appear at the largest relaxation 
time scale of the polymer, which can be of the same or- 
der of magnitude as the average cyclization time, Marko- 
vian approximations have inevitably a restricted range of 
validity^. Alternative theoretical approaches have been 
propo sed, su ch as the use of the renormalization group 
theorjEEnH which provides for long chains perturbative 
results for small values of 4 — d, with d the spatial di- 
mension. More recent approaches include a refinement 
of the Wilemski-Fixman theory that considers the cor- 
relations between the initial and the final state^^HHl^ an 
exact formal iterative solution to the cyclization prob- 
lem in one dimension^, or a derivation of the cyclization 
time in the limit of a very small reactive radius from first 
principle^2ll. 

In a recent worlP^l^ we proposed another approach of 
the problem, in which non-Markovian effects are explic- 
itly taken into account by determining the statistics of 
the non-equilibrium polymer conformations at the very 
instant of the reaction. This approach is referred to below 
as the non-Markovian theory. Examples of reactive con- 
formations obtained by this approach are shown on Fig. 
[T] where one observes that a polymer tends to be more 
elongated at the instant of cyclization than in an equilib- 
rium looping conformation. This elongated shape leads 
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FIG. 1. Examples of polymer conformations at the instant 
of first contact between the two ends of a Rouse chain. The 
conformations in (a) are obtained from Brownian dynamics 
simulations, while the conformations in (b) are the results of 
the non-Markovian theory that is described in this paper. In 
(c), we represented equilibrium looping conformations. The 
fact that the reactive conformations (a,b) tend to be more 
elongated than equilibrium looping conformations (c) leads 
to a faster reaction kinetics than predicted by the Wilemski- 
Fixman Markovian theory. The red thick vertical line rep- 
resents the end-to-end vector R. The number of monomers 
is A'' = 300 monomers and the capture radius is a = 4.33. 
Precise statistics of reactive conformations for the same pa- 
rameters are presented in Figs. [3j |4] [5] and [6] 



to reaction times that are faster than predicted by the 
Wilemski-Fixman Markovian approach. The main goal 
of the present paper is to complete this non-Markovian 
theory of polymer intramolecular reaction kinetics, and 
to present new results that are related to it. In partic- 
ular, we provide an analytical description of the average 
reactive conformation of the polymer, and we describe 
the non-Markovian theory in detail. We also review the 
existing approaches of cyclization in the context of our 
non-Markovian approach and of stochastic simulations. 
Establishing the validity range of the Markovian theories 
is important because they are frequently invoked in the 
analysis of experiments o n hairpin formation or the fold- 
ing of polypeptide chainii^li2IMl_ Finally, we also give 
explicit formulas that describe the effect of the position 
of the reactants in the chain for the reaction kinetics, and 
we show that the j-factor, defined as the equilibrium con- 
tact probability density between the reactant^^HM! ap_ 
pears naturally in the expression of the reaction time 
but is not the only determinant of diffusion controlled 
reaction kinetics. We describe in the framework of the 



Wilemski-Fixman approximation how the reaction time 
is slowed when the reactive monomers belong to the in- 
terior of the chain. The present work completes another 
paper dealing with intermolecular reactionJ^. 

The outline of this paper is at follows. In section [llj 
we introduce the formalism of the Rouse chain, and we 
give a simple argument that enables us to derive a scaling 
relation for the reaction time. Then, in section |III[ we 
present in detail a non-Markovian theory of intramolec- 
ular polymer reaction kinetics. In section [TV] we discuss 
the validity of the hypotheses of existing Markovian the- 
ories in the context of our non-Markovian theory. In sec- 
tion [Vj we determine the accuracy of the non-Markovian 
theory and of existing theories by confronting them to 
the results of stochastic simulations. In section |VI[ we 
study in details the reactive shape of the polymer both 
with our non-Markovian theory and with simulations. Fi- 
nally, in section |VIH we investigate the role of the posi- 
tion of the reactive monomers. This work is completed 
by appendices, where we remind some useful properties 
of Gaussian processes and we give details of calculations. 

II. DEFINITION OF THE ROUSE CHAIN AND SIMPLE 
SCALING EXPRESSION FOR THE REACTION TIME 

We consider the Rouse model of a polymer chain evolv- 
ing in a three dimensional (3D) space. The chain is 
formed by N monomers located at positions (1 < J < 
N), in which quantities in bold denote vectors in 3D. 
The monomers are connected by linear springs of stiff- 
ness k. Each monomer is submitted to a friction force 
(with C the drag coefficient) and diffuses with a diffusion 
coefficient D — ksT/^, where T is the temperature. The 
length Iq — ^yksT/k is the typical length of a bond, while 
To — C/k is the typical relaxation time of an individual 
bond. In the present paper, the microscopic length Iq and 
the microscopic time tq are set to 1, which fixes the units 
of length and time. We are interested in the kinetics of a 
reaction that occurs between two reactive monomers of 
the same chain, that have the indexes p and q. We de- 
fine the vector R that joins the positions of the reactive 
monomers: 

R=r, -Tp. (1) 

In the diffusion controlled regime, the kinetics of the re- 
action between the monomers p and q is quantified by 
the mean time T it takes for the distance ||R|| to reach 
a value that is smaller than a typical reaction radius a. 
Of course, the reaction time depends on the choice of ini- 
tial conditions of the chain. We call T the reaction time. 
Here, we will focus on the case where the polymer is ini- 
tially at thermal equilibrium, conditional to the fact that 
the vector R is outside the reactive region (||R|| > a). 
Note that, although the Rouse model is highly simplified 
because it neglects both hydrodynamic interactions and 
excluded volume interactions, it catches some important 
aspects of polymer dynamics^ ^, and the calculation of 
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intramolecular reaction times is non- triviaP. The fact of 
studying the Rouse model enables us to focus on the non- 
Mar kovian effects, but our theory could be generalized to 
more complex polymer models. 

Before exposing the theoretical determination of the 
reaction time, we remind some characteristics of the dy- 
namics of R(t) at different time scales that will be use- 
ful in the non-Markovian theory. The dynamics of R(t) 
can be characterized by considering the evolution of the 
monomer positions in terms of independent Rouse modes. 
The definition of the Rouse modes a, fo llow s from the 
diagonalization of the connectivity matri>!pl3II^ and these 
modes are related to the positions by a linear relation: 



N 



where the transfer matrix Q.^a reads: 



Q^ 



3lj 



N 



cos[(z-l/2)(j-l)ViV] 



(2) 



(3) 



A polymer configuration can therefore be described ei- 
ther as a set of positions |r) — (ri,...rjv), or as a set 
of the values of the Rouse modes |a) = (ai, a^r). In 
this paper, we adopt the convention that \u) represents 
a TV-components column vector. The modes a^ evolve 
according to the Fokker-Planck equation: 



dP 

~dt 
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(4) 



where the relaxation times l/A^ are the inverses of the 
eigenvalues of the connectivity matrix, which are given 
for 1 < z < by the relation: 



A, = 2{1 -cos[(i- l)7r/A^]}. 



(5) 



The first eigenvalue is Ai — 0: the first mode ai is pro- 
portional to the polymer center-of-mass position, which 
has a diffusive motion. We call tji — X^^ ~ N'^ jw^ the 
largest relaxation time of the polymer. The second eigen- 
mode a2 is associated to this time scale, and describes 
the shape of the polymer at the typical length scale V^- 
The other modes a^ describe the shape of the polymer 
at intermediate length scales between \/ N (the size of 
the polymer) and 1 (the size of a single bond). When 
N ^ oo, the Rouse modes are simply proportional to 
the Fourier coefficients of the function r(s), where s is 
the curvilinear coordinate of a monomer in the chain. 

From ([T]) , ([2]) , we observe that we can express the vector 
R as a linear combination of the Rouse modes, which 
leads to the definition of coefficients bi by the relation: 



R 



N 



Qqi Q 



pi • 



(6) 



Importantly, the first coefficient vanishes (&i = 0), which 
means that the dynamics of the vector R is indepen- 
dent on the position of the polymer center-of-mass. At 



long times, the process R(i) reaches a stationary state. 
We call L the equilibrium distance between the reactive 
monomers, so that is the variance of each spatial coor- 
dinate of R at equilibrium (the variance of R^ at equilib- 
rium is iL^). From we note that the variance of each 
coordinate of a.^ at large times is 1/Ai, and we deduce the 
value of the equilibrium length L: 



N 



(7) 



where the last equality follows from the use of the explicit 
expression (|6]) for the coefficients 6^. Equation ([7| simply 
states that the equilibrium distance between monomers 
of indexes p and q scales as \/\p — q\, as expected from 
the central limit theorem. 

The stochastic process R(t) is Gaussian, its dynam- 
ics can therefore be characterized by the evolution of its 
average and variance. Let us consider initial conditions 
where the polymer is at equilibrium, with the supple- 
mentary condition that R = Rq. Then, the average of 
R(i) at a later time is given by (R(i)) = 0(t)Ro: the 
function 0(t) describes how the average of R(i) relaxes 
to its equilibrium position. This function is such that 
(/)(0) — 1, and (t>{t) vanishes at large times. The function 
(j) is given by the following expression, whose derivation 
is reminded in Appendix [B{ 



N 



-Xit 



A,L2 



(8) 



We also define a function tp{t), that represents the vari- 
ance of each coordinate of R(i), given that initially the 
value of R is Rq and that the polymer is at equilibrium: 



(9) 



From the expressions (|8|,([9]), we deduce the behavior of 
the mean square displacement function ^pit) &t different 
time scales when N ^ 1: 

Ut (< < 1 ; Ai? < 1) 
■4>{t) ~ <j AtVi (1< i < A2 ; 1< Ai? < Vn) . (10) 
L2 (t > ; Ai? ~ VN) 



In Eq. (10), the behaviors arc distinguished for the dif- 
ferent time scales t. The limiting behaviors can also be 
discussed for the different length scales Ai? ~ tp^^^ as 
well. We remind that the length Iq and the relaxation 
time To have been chosen as units of length and time. 



According to (10 1, the motion is diffusive at short times. 



where the reactive monomers behave as if they were dis- 
connected from the rest of the chain. At intermediate 
time scales, ip ^ n t^/"^, the motion is subdiffusive and 
results from the interactions with all the monomers of 
the chain. The coefficient k does not depend on N , 
but depends on the position of the reactive monomer 
in the chain (k = S/^/n if the reactive monomers are at 
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the chain ends, whereas k = i/y/iT for interior reactive 
monomers, see Appendix [b|) . At long times, ip is con- 
stant, and the process R reaches the stationary state. 
From the behavior ( 10 1 of the mean square displace- 



ment tp at different time scales, we can derive a simple 
scaling law for the reaction time by using the system- 
atic procedure introduced in a recent paper dealing with 
intermolecular reactions^i-. Note that alternative quali- 
tative reasonings can be found in other referenceJI^EIl 
The fact that ip L'^ for large times indicates that the 
parameter L"^ ~ N^/^ plays the role of an effective con- 
fining volume. Let us assume that the size of the re- 
active region is small compared to the length of a sin- 
gle bond (a <C 1), and that initially the polymer is at 
equilibrium. The diffusion controlled reaction occurs in 
two sub-steps that involve the properties of the stochas- 
tic process R(i) at two different length scales. The first 
step of the reaction consists in reaching for the first time 
a distance of order 1 between the reactants. According 
to (10), at these large length scales, R is subdiffusive: 
tp ~~7R(i)^) ~ t^^^ = t^/''™, defining a walk dimension 
— 4. A Markovian walker that has a walk dimen- 
sion of dw and starts in a random position in a confining 
volume of radius L would reach a punctual target in a 
tim^^sHUl y ~ L"^^' . Assuming that this scaling argu- 
ment holds for our non-Markovian problem, we deduce 
that the average time needed to complete the first step 
of the reaction scales as T ~ L'^™ ^ N'^ . 

Once the length 1^ = 1 has been reached for the first 
time, there is a second step in the reaction, that consists 



in reaching the sphere of radius a. From ( 10 1, we observe 



that, at small length scales, R(i) behaves as a diffusive 
process with diffusion coefficient of oder 1. Assuming 
that the reaction time is the same than for a diffusive 
Markovian walker'^^ in a confining volume V — Li^ with 
an initial position that is far from the reactive site, we 
get the estimate for the time of this reaction substep: 
T ~ l? ja. Adding the two times corresponding to the 
two substeps, we obtain the total reaction time: 



T ' 



iV3/2 



(11) 



From (11), it is clear that the dominant term for the 
reaction time is when N is large and comes from the 
subdiffusive reaction substep. However, the first term of 
(11) becomes important when the size a of the reactive 
region is smaller than N~^/'^. 



The presence of the regimes ap pearing in Eq. (11 ) for 
the reaction time is already knowrP^^IMM]^ ^j^g garliest 
discussion was done by Doi'^*, who noted that the scal- 
ing T ~ N'^ , that is predicted by the Wilemski-Fixman 
theory^ ^ for large N , appears only when the interactions 
between the monomers are considered. Indeed, the fact 
of replacing the whole polymer by a single spring leads 
to the different law T ~ N^/'^a~^ for the reaction tim^^H^ 
a result which was recovered with another approach of 
the harmonic spring model in the SSS theory of Szabo, 
Schulten and Schulten^. Importantly, as noted in a pre- 



vious worli^ and reminded below, the Wilemski-Fixman 
treatment of the full problem predicts both behaviors 
N"^ and N'^/^a^^ that appear in the large N and small a 
limits, respectively. The behavior N^^'^a~^ has also been 
recently derived from first principlefl^, while the scaling 
relation T ^ N"^ appears in the treatment of the problem 
by the renormalization group theorjI^S!. The presence of 



the two regimes of ( 11 ) has been checked with numerical 
simulations only recently^Sj. At this stage, we have used 
a simple scaling argument to derive the scaling relation 
( 11 ), where the apparition of two regimes is linked to the 



presence of two substeps where the motion of the reac- 
tant is qualitatively different, corresponding to the differ- 



ent regimes appearing in Eq. (10 1. Note that the scaling 
relation (11) is not sufficient to characterize the reaction 
time, as it does not describe its behavior for finite values 
of N and a and it does not permit the identification of 
the numerical coefficients. In the next sections, we de- 
scribe a non-Markovian theory that enables the precise 
derivation of the reaction time. We also discuss the va- 
lidity of the hypotheses made in the existing theories in 
the context of our more general theory. 



III. NON-MARKOVIAN THEORY FOR THE KINETICS 
OF INTRAMOLECULAR REACTIONS 

We now derive the equations of the non-Markovian the- 
ory of intramolecular reaction kinetics that was briefly in- 
troduced in a previous worlP^l. As these equations share 
a lot of similarities with the equations for intramolecu- 
lar reactions that are described in details in Ref.'^iJ, we 
refer to this reference and to Appendix [C] for calcula- 
tion details. The starting point of our analysis is to con- 
sider the stochastic process formed by the positions of 
all the monomers Vi{t) (or, equivalently, of all the modes 
SLi). This process, to the difference of the single process 
R(t), is Markovian, which enables us to use the renewal 
theorjPni. Let us temporarily consider the case where 
the polymer does not react when it reaches the reactive 
zone. We define an arbitrary position Rj that is inside 
the reactive zone (i.e. with |jR/|| < a), and we consider 
a configuration |a) such that R = R/. If this configu- 
ration is observed at it means that the polymer must 
have crossed the reactive boundary at some earlier time 
i', with a configuration |a'). Therefore, defining /(|a'), t') 
as the probability density of reacting for the first time at 
t' with a configuration |a'), we can write the following 
renewal equatiorP^ 



P(|a),t|{ini})= j dt' j dW)f{\B!),t')P{\^),t\\B!),t'). 

(12) 

In this equation, dja) = dsLi...dsLtq , and P{\a),t\ |a'),t') 
is the probability of observing the configuration |a) at t 
given that the configuration |a') was observed at time t' . 
Similarly, P(|a), t|{ini}) is the probability of observing 
the configuration |a) at i given an initial distribution of 
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conformations at t = 0. The initial distribution of modes 
is assumed to be an equilibrium distribution, given that 
the initial distance between the reactants is larger than 
the reactive radius (Rq > a). Let us introduce Q, which is 
a shortcut for both the polar angle and the azimuthal 
angle tp, with dfl — sm 9 d9 dip /At:, and u,.(r2) the unit 
radial vector pointing outwards the unit sphere in the 
direction defined by {9,ip). Then, the initial distribution 
of conformations can be written as: 



^ini(la)) = 
J a 



Z{a,L^) 



dnP,te.t{\a)\RoUr{n)), (13) 



where Pstatda) |R-i) represents the equilibrium probabil- 
ity density of conformations given that R = Ri, and 
where the normalization factor Z is defined by: 



Z{a, h) 



dRo Rq e 



-Rl/(2h) 



(14) 



Taking the Laplace transform of Eq. (12), and develop- 



ing for small values of the Laplace variable leads to an 
equation that involves the mean first passage time: 

TPstatda)) = 

poo n 

J dtj dn[p{\a),t\TTn,Q)~P{\a),t\Pini,n,0)\. (15) 



In Eq. (15), we have introduced the splitting probabil- 



ity TTfj ( I a) ) , which represents the distribution of config- 
urations I a) at the very instant of reaction, given that 
R = a Ur(f2) at the instant of reaction. Similarly, 
-Pmi,o(|a)) represents the initial distribution of modes 
given that R is in the direction Ur(ri). The quantity 
P(|a), i|7rn, 0) represents the probability density of ob- 
serving I a) at t given that the initial distribution of the 
modes was ttq. Equation (15) is an exact integral equa- 



tion that defines both the reaction time T and the split- 
ting probability distribution ttq. Integrating it over the 
configurations that are such that R 
following exact expression of the reaction time 



Rf leads to the 



rp,tat(R/) = 

dt J dfl 
1 



P(R/,i|7rn,0) 

dRoRle'"^ P{Iif ,t\{RQVir,stdX},Q) 



(16) 



in which P(Ry , t|7rn, 0) represents the probability density 
of observing R in position Ry, given the initial distribu- 
tion TTQ for the conformations at time t = Q. Equation 



( 16 ) shows that the reaction time is inversely propor- 
tional to the factor Pstat(R/)7 which is the equilibrium 
probability density to find the two reactive monom ers in 
contact, and is sometimes referred to as the j-iacto^^^. 



It plays the same role as the inverse of a confinement vol- 
ume in the case of intermolecular reaction^^, and it can 
be written explicitly: 



1 



(27rL2)3/2 



exp 



R^\ 
2i2 ) 



(17) 



Equation ( 16 ) is an exact expression of the reaction time. 



However, it cannot be used without knowing the splitting 
probability distribution ttq. 

As in the case of intermolecular reaction^^, the key hy- 
pothesis of the non-Markovian theory is to assume that 
the splitting distribution is a multivariate Gaussian 
distribution: only the values of the average and the co- 
variance of the modes a^ over ttq have to be determined. 
This Gaussian approximation of ttq considerably simpli- 
fies the problem. Instead of having to find a function 
7rn(ai, ...,3.^) of 3iV variables that is solution of the in- 
tegral equation (15), one is left to finding a finite set of 



unknown quantities, that are the first and second mo- 
ments of ttq. 

The first moments of ttq are the average value of the 
modes a, at the instant of the reaction, given that the 
reaction takes place in the direction and are noted . 
For symmetry reasons, is oriented in the radial direc- 
tion defined by the unit vector Ur(n), and we call mf its 
component in this direction: mj^ = m^u^. At a time t 
after the reaction, the average value of a^ in this direction 
is simply m^e~^'*. Summing over the coefficients 6i, we 
deduce the value of R-^it), defined as the average value 
of R in the radial direction Ur(r2) at a time t after the 
reaction, which reads: 



N 



RAt)^2^b, 

2 



-Ait 



(18) 



We will derive below a set of equations that define the 
moments in a self-consistent way. In principle, in the 
non-Markovian theory, one should also determine the co- 
variance matrix of ttq. Here, for simplicity, we assume 
that the covariance matrix of ttq is well approximated 
by the covariance matrix that characterizes the equilib- 
rium conformations that make a loop. We call this ap- 
proximation the "stationary covariance approximation" , 
which was fo und to be accurate in the case of intermolec- 
ular reaction^^Hm deduce the value of the equilib- 
rium covariance matrix from the formulas on conditional 
Gaussian probability distributions that are mentioned in 
Appendix \K\ and Appendix [B] 



bib. 



(19) 



where a and /3 represent the spatial coordinates x,y,z. 
Note that, although the modes a^ are independent at 
equilibrium, the fact of conditioning them to a particu- 
lar value of the vector R = ^i^i introduces correlations 
between them. In the stationary covariance approxima- 
tion, the moments mj^, together with the reaction time 
T, are the only unknown variables of the theory. 
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Under the stationary covariance approximation, we 
can write an expUcit form of the reaction time (16 1: 



_ rI [Rj:-Ko,»ur(n)l^ 



(20) 



where the functions ((> and ^ describe the relaxation dy- 
namics towards equilibrium, and have been introduced 
previously in Eqs. ([8|,(|9]). The integral over the an- 
gles n ~ d,(j) in ( 20 ) can be performed by noting that 
(R/ - R^Ur)^ = Rj + Rl - 2RfR^ cos 6*, but the result- 
ing expression is however not simple. At this stage, we 
note that, by construction, the theory should predict the 
same value of T whatever the choice of the final position 
R/. In particular, with R^ = 0, we obtain a simpler 
expression for the reaction time, which writes: 



(21) 



where we remind that the function Z is defined in Eq. 
(14). This important expression is the simplest expres- 
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f°° dt 










Jo ^'/^ 


exp 1^ 




Z(a,L2) 



sion of the reaction time in the non-Markovian theory. 

At this stage, we need to write a set of equations that 
enable the calculation of the moments . This set of 
equations is found by multiplying the general integral 
equation ( 15 ) by the factor ai(5(R/ — J2j ^^'^ inte- 

grating over all the modes (see Appendix [C|) . Because of 
the presence of the S function, the resulting terms involve 
the conditional average /i^'*^, which represents the aver- 
age value of the i^^ mode at a time t after the reaction, 
given that the vector R has a value R = at the same 
time t. The expression of can be deduced from the 
formulas on conditional Gaussian distributions that are 
given in Appendix lAj 



ir,0 TT — A.t 



R^{t)h[l - (l){t)e- 



(22) 



After the multiplication of the integral equation ( 15 ) by 
the factor ai(5(R/ — X^j^j^j); ^'^'^ integrating over all 
the modes leads to to the following set of self-consistent 
equations, that are derived in detail in Appendix [C| 




Z(a,L2)A 



0, (23) 



where we have defined the function G by the following 
relation: 



/"OO 

G(a, h) = dRoR^ exp 

J a 



Rq 
2h 



(24) 



As already mentioned, the conditional average lJ-J'^{t) 
plays a key role in the self-consistent equations that de- 
fine m^. Other terms come from the results of the aver- 
age over angles fi and initial distance Rq. The relation 
(23) is valid for 2 < z < and actually forms a set of 
A'^ — 1 equations that entirely define the moments mf ; it 
is the key equation of the non-Markovian theory as it en- 
ables to define in a self-consistent way the average value 
of the polymer conformation at the instant of the reac- 
tion. In fact, there are only A^ — 2 equations, because R is 
known to be equal to a with probability one over the 
distribution ttq, which implies the relation bj-m^ — a 
that is compatible with ( 23 ) . Solving the non-Markovian 
theory consists in solving the set of equations ([23]) to ob- 
tain the average value of the modes at the reaction, 
which can be done numerically or analytically in some 
limiting cases (see below). Then, the result is inserted 
into the expression of the reaction time (21). We stress 



that, until now, we did not make any hypothesis on the 
actual location p and q of the reactive monomers, which 
enter only in the definition ^ of the coefficients bi and 
therefore in the functions (p and ip. 



IV. RELATION OF THE NON-MARKOVIAN THEORY 
TO EXISTING MARKOVIAN THEORIES OF 
CYCLIZATION 

Now, we show how introducing supplementary approx- 
imations in the non-Markovian theory enables to recover 
some existing theories. All the theories described in this 
section will be confronted to simulations in section |Vl 
For simplicity, we consider the case where the reactive 
monomers are at the polymer extremities (p — 1 and 
q^N). 

The first theory that we consider is the Wilemski- 
Fixman theorjW^J^ which turns out to appear naturally 
in our equations by making a local equilibrium assump- 
tion. A Markovian approximation consists in assuming 
that the distribution of conformations at the instant of 
reaction is an equilibrium distribution, conditional to the 
restriction that R = a Ur(ri), where Ur(ri) is the unit 
vector defining the direction in which the reaction takes 
place. Formally, this approximation can be written as: 



7ro(|a)) ~ Pstat(|a)|a Ur(0)). 



(25) 



In our theory, we already assumed that the covariance 
matrix of ttq is given by the stationary covariance matrix, 
so that the approximation ( 25 ) consists in assuming that 



the moments m^, instead of satisfying the self-consistent 
equations ( 23 ) , are equal to the average value of the 
modes at equilibrium, given the condition R = au^. Ap- 
plying a formula on conditional Gaussian distributions 
that is reminded in Appendix [Aj we find that this ap- 
proximation leads to the following expression for the the 
average value of the modes rnf : 

^ — . (26) 
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We can gain insight in the meaning of this formula by the reaction time as a form of a convergent integral: 
considering {zijT^ = '^^Qij'm'^, which is the average 



value of the position of the i monomer in the radial 



direction at the instant of reaction. Approximation ( 26 ) 
implies that {zi),^ is linear in the part of the polymer 
between the reactive groups {p < i < q), whereas it is 
constant at the exterior. 



Inserting ( p6| into ( 18 1 and comparing with ^ implies 
that the reactive trajectory in the Markovian approxima- 
tion is simply i?7r(0 — Then, evaluating the gen- 
eral formula ( 16 ) with these approximations and taking 
the value Rf — a, we obtain the following expression for 
the reaction time in the Markovian approximation: 



T: 



dt 



a20(l - (j)' 



2)1/2 



sinh 



(27) 



In fact, the expression ( 27 ) is valid only when a L, oth- 
erwise the term "—1" in the second part of the integrand 
would be replaced by a more complicated term (strictly 
speaking, ( 27 1 gives the mean first passage time to the re- 



active sphere, but with an initial distribution that is an 
equilibrium distribution which can be inside or outside 



the reactive region). The expression (27) is the reaction 



time obtained by using a Wilemski-Fixman approach and 
choosing a delta sink functiorP, which is the most accu- 
rate choice of sink function to describe reactions in the 
diffusion controlled regim^. This result clearly shows 
that the Wilemski-Fixman approach is equivalent to the 
assumption that the reactive conformations of the poly- 
mer can be replaced by equilibrium looping conforma- 
tions. Our theory also provides an alternative formula 



for the reaction time, that is simpler than (27), and that 
is obtained by calculating ( 16 ) with the vali 



ue Rf = 0: 



t Mark. 



dt 



i,{t)3/^ 



-a>V(2^) 



Z{a,L^) 



(28) 



The two formulas ( 27 ) , ( 28 ) are two legitimate expressions 



of the reaction time in the Markovian approximation. 

Let us briefly derive the scaling relations predicted by 
the Markovian theory from Eq. (28) in the case of the 



cyclization reaction. First, let us consider the limit of a 
large number of monomers, at a fixed value of a. In this 
limit, the function tends to a limiting function $(t) 
that depends on the rescaled time r = t/N'^, and that is 
deduced from (|3]) , ([6]) , ([s]) : 



dr 



1 

[l-$(r)2 ]3/2 



- 1 ^ (TV -> oo). (30) 



Evaluating numerically this integral leads to the formula 
T ~ 0.201 N'^: we recover the fact that, in the Wilemski- 
Fixman theory, the mean cyclization time is proportional 
to iV2 for large N at fixed a. It is however interesting 
to note that the limit of small reactive region leads to 
another scaling relation. In the limit a — > at fixed 
N, the expression (28 ) can be evaluated by replacing the 
integrand by its short time limit (in which ^ ~ 1 and 
~ 4i): 



T: 



g-aV(8t) ^{N- 1)3/2 

dt ,. = ^ — ^— '- — (a -> 0). 



(4 <)3/2 



VS a 



(31) 



Therefore, the Markovian theory contains the two scal- 
ings T ~ N'^/'^/a and T ^ N'^, a. fact that had already 
been noted by Pastor et al.^. The two scalings (30), (31) 



correspond exactly to the general scaling relation (11), 



that was derived by assuming that the reaction occurs in 
two substeps, that involve the subdiffusive and diffusive 
behavior of the end-to-end vector at large and interme- 
diate time scales, respectively. 



It turns out that the formula (30) can be linked with 



the results of the renormalization group theory of cy- 
clization, that are valid when the value of the spatial 
dimension d is close to c? = 4. A simple generalization of 



(30) to the case of a d-dimensional space leads to: 



T = N^ 



dr 



[1 - $(T)2]'i/2 



- 1 



(32) 



This integral converges only for d < 4. If we introduce a 
fixed (small) time ti, we see that in the limit 4 — d — >■ 0, 
(32) is approximately equal to: 



''^[8/^/7FTl/2][2-(4-d)/2]' (33) 



where we neglect supplementary terms that do not di- 
verge as d — >■ 4. By performing the integral (33), we find 
that the result at smallest order in the parameter 4 — d 



T: 



TT iV2 



16(4- d) 



RG 



(d^4,iV^oo). (34) 



$(r) = lim <j){TN') = , 

q odd 



E 



(29) 



Due to the infinite number of terms in ( |29[ ), the func- 
tion $ has an anomalous behavior for small values of the 
rescaled time: $(t) ~ 1 — 4(T/7r)^/2^ which reflects the 
subdiffusive behavior of the end-to-end vector. Due to 



this anomalous behavior, we can rescale ( 28 ) and express 



The expression (|34j) is exactly the scaling relation ob- 
tained in the renormalization group theorj^^. Interest- 
ingly, setting c? = 3, we find that the the numerical coef- 
ficient of the scaling relation (34) is 7r/16 = 0.196. This 
value differs by only 2.4% from the value of 0.201 that 
was estimated by the Markovian approximation. The 
fact that this result of the renormalization group can be 
derived by developing the reaction time obtained from 
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the Markovian theory suggests that there is a local equi- 
librium assumption that is made in the renormalization 
group theory. 

Finally, we discuss the evaluation of the mean first pas- 
sage time in the simplest theory of cyclization: the har- 
monic spring approximation, which consists in replacing 
the whole polymer chain by a single spring that has an 
effective stiffness k^s- Due to the fact that there are iV— 1 
bonds in series, this effective stiffness is fceff = k/ {N — 1). 
Then, one assigns a single effective drag coefficient (cS 
(which is to be determined later) to the end-to-end vec- 
tor R. With these assumptions, the process R is now 
characterized by a single time scale TeS = Ces/keS- It 
is therefore Markovian and its first passage properties 
can therefore be computed analytically, as was done by 
Szabo, Schulten and Schulten* who determined the re- 
action time by directly solving the adjoint equation and 
who obtained: 



sss 



TcS 

L^Z{a,L^) 



dx 



dy y e" 



(35) 



An alternative method to solve the harmonic spring prob- 
lem is to use the Renewal theory, whose result is di- 
rectly given by equation (2f I, in which (j) must be re- 



placed by its value for an Ornstein-Uhlenbeck process 
{4> = exp(— t/Tcff)). Of course, these two methods lead to 
the same results, as they both are an exact treatment of 
the harmonic spring model. If we take the limit a — >■ or 
iV — >■ oo in the SSS expression (35 1, we obtain at leading 
order: 



Tsss 



'I a 



(36) 



In the harmonic spring approximation, the choice of 
the effective drag coefficient (cS is quite arbitrary. A 
natural choice is to set Ccff = C/2, which corresponds 
to the approximation that the two end-monomers be- 
have as if they were disconnected from the rest of the 
chain. With this choice, the scaling relation (36) coin- 



cides with the Markovian scaling law (31) obtained for 
small a. However in this case, the effective relaxation 
time Tcff = C(iV— l)/(2fc) is very different from the largest 
relaxation time of the polymer tji = A^^C/(7r^fc), which 
is one of the reasons why the SSS theory is not expected 
to give an accurate estimation of the reaction time. 

In the next section, we compare all these different the- 
ories with the results of numerical stochastic simulations. 



V. COMPARISON OF THE DIFFERENT THEORIES OF 
CYCLIZATION WITH NUMERICAL SIMULATIONS 

In order to test the validity of the different theories, we 
compare it to the results of numerical simulations. For 
simplicity, we restrict ourselves to the case of the end- 
to-end cyclization reaction, where the reactive monomers 




10^ 



10' 



N 



FIG. 2. Average end-to-end cyclization time in simulations 
(symbols) and different theories (lines). There are 2 sets of 
curves, corresponding to 2 values of a (upper blue curves and 
circles: a = O.ly/S, lower red curves and squares: a — 2.5\/3)- 
In each set of curves, the continuous lines are not a fit but 
represent the results of the non-Markovian theory. The up- 
per dashed Hues represent the results of the Wilemski-Fixman 
Markovian theory evaluated with equation 1 28 1, while the 



lower dash-dot lines represent the results of the SSS theory 
evaluated with equation ( 35 1. For the smallest value of a, the 



non-Markovian and the Markovian results are undistinguish- 
able on this figure. 



are located at the extremities of the chain {p = 1,9 = N). 
For small values of a, we used directly the results of Brow- 
nian dynamic simulation with adaptative time step per- 
formed by Pastor et alW. We carried out supplementary 
simulations for larger values of a and N with the same al- 
gorithm and a smaller value of the time step. Apart from 
exploring supplementary ranges of parameters, these sim- 
ulations enable us to analyze the statistics of the poly- 
mer reactive conformations, which has to our knowledge 
not been done before. In brief, the simulations use a 
Brownian dynamics algorithm and consist in generating 
stochastic trajectories that integrate the Langevin equa- 
tion which corresponds to the Fokker-Planck equation 
([4]). The fact that the time step is reduced when ||R|| — a 
becomes small is useful to increase the numerical preci- 
sion. 



The results for the reaction time are presented on Fig. 
[2] and table |l] where the simulation results are compared 
to the various theories. As can be observed, the non- 
Markovian theory accurately predicts the values of the 
reaction time for all the values of and a. The non- 
Markovian result is almost always within the statistical 
error of the simulations, although the reaction times are 
estimated over large numbers of realizations which range 
from 2, 500 to 20, 000. In comparison, the results of the 
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TABLE I. Values of the end-to-end cyclization time determined by stochastic simulations and different theories. Tsimuiation 
is the average first cyclization time in stochastic simulations, with ST giving the 95% confidence interval resulting from the 
error due to the finite number of simulation runs. TNon-Markovian i s th e result of the non-Markovian theory pres ented in this 
paper, TMarkovian is the result of the Markovian (Wilemski-FixmarP^ approxir nati on and is evaluated with (281. Tsss is the 
result of the SSS harmonic spring approximatiorP and is given by the formula (351. Trg is the result of the renormalization 
group approach and is given by formula ( |34[ |. Last, Takh is the result of Amitai, Kupka and HolcmaiPl and is given by 
Takh ~ s/t^/SN^^^ /a + SAsN^, with A3 ~ 0.05. Trg and Takh are only asymptotically valid for small A'^ and a, respectively, 
and are shown here only for the sake of completion. The simulation results for a = 0.87 are taken from Pastor et alW. Note 
that a is in units of /o and that all the times are expressed in units of (^/k — tq. 



a 


N 


^simulation i (5T 


Tnoii- Markovian 


^Markovian 


Tsss 


2rg 


Takh 


0.87 


50 


522 ± 30 


543 


608 


247 


491 


630 




100 


2040 ± 60 


2096 


2340 


707 


1962 


2220 


4.33 


50 


128 ±2.5 


123 


246 


37.6 


491 


426 




100 


642 ± 10 


599 


1217 


119 


1962 


1645 




300 


7439 ± 350 


6933 


13583 


1071 


17660 


14251 



Markovian approximation are much less accurate, and 
the error made is roughly 100% when the reactive radius 
a is not too small. For small values of a the Marko- 
vian approximation is hovifever excellent, we will see be- 
low that the reason for this is that Markovian and non- 
Markovian theories predict the same asymptotic form of 
the reaction time T in the limit of small a. The results 
of the harmonic spring approximation are less accurate 
than those of the Markovian theory, and can even differ 
by a factor of 7 from the cyclization time for the larger 
value of N. As mentioned above, the SSS theory predicts 
only the scaling relation T ^ N^/"^ /a for large N , con- 
trarily to what is predicted in more precise theories and 
simulations. We also included in tableUthe results of the 
renormalization group theory calculated with Eq. (34), 



which does not predict correctly the values of the reaction 
time for a = 4.33. Of course, the result of the renormal- 
ization group theory cannot be accurate in all regimes, as 
it is only asymptotically valid in the limit of large TV. To 
be complete, we also included in table|l]the recent results 
of Amitai et al^^, who found that in the limit of small a, 
the reaction time is given by T ~ ^JV/^N'^/'^ / a + iA:iN'^ , 
where A^ ~ 0.04 — 0.055 is a numerical coefficient ob- 
tained by fitting the data of numerical simulations. As 
can be seen in table |lj this formula is not accurate for 
the larger values of a, which is not surprising since it is 
supposed to be valid only in the limit of very small val- 
ues of a (a <C 1/VN). In conclusion, the non-Markovian 
theory presented in this paper appears to be very accu- 
rate for all values of the parameters that we tried, to the 
difference of the existing theories that all have a limited 
range of validity. 



VL THE REACTIVE SHAPE OF THE POLYMER AT 
CYCLIZATION AND THE ASYMPTOTIC FORM OF THE 
REACTION TIME ON THE NON-MARKOVIAN THEORY 



equilibrium reactive polymer conformations. We now fo- 
cus on the description of these reactive conformations in 
the non-Markovian theory and in the simulations, and 
we determine whether or not the hypotheses of the non- 
Markovian theory are reasonable. We study in details the 
case N = 300 and a — 4.33, for which 2,440 cyclization 
events have been recorded, and where the non-Markovian 
effects are important since the Markovian approximation 
overestimates the reaction time by a factor of 2. The 
statistics of the reactive conformations is presented for 
these parameters in Figs. [3) |4j [5j [6] and completes Fig. 
[1] where examples of reactive conformations in 3D are 
presented. 
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100 
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FIG. 3. Superposition of histograms of the modes in all 
spatial directions, for N = 300 and a — 4.33. For each value 
of i (2 < i < TV), the values of at at the instant of first cy- 
clization are recorded in the simulation and rescaled by their 
average mean and variance over 2440 realisations. The proce- 
dure is done for the radial and the two perpendicular spatial 
directions. All the resulting histograms appear in this fig- 
ure, and are compared to the standard normal law (red thick 
curve). 



As already mentioned, the key point that the clas- 
sical Markovian theories do not consider is the out-of- 



The first hypothesis of the non-Markovian theory is 
that the splitting distribution tt is a multivariate Gaus- 
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sian. This hypothesis is tested on Fig. [3j where we rep- 
resented the superposition of the histograms of all the 
modes in all spatial directions, after rescaling by the 
measured average and variance. If the Gaussian approx- 
imation were accurate, then all the histograms obtained 
in this way would look like a normal distribution, which is 
obviously the case in Fig. [3j For each mode number i and 
each spatial direction a, the marginal probability density 
7r(ai^c() looks like a normal distribution. Although we 
cannot readily deduce that the multivariate distribution 
7r(|a)) is Gaussian, these results indicate however that 
the Gaussian approximation is a very reasonable one. 
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FIG. 4. Blue squares: diagonal terms ajj (2 < j < TV) of 
the covariance matrix of the modes at the instant of first 
cyclization, in the 3 spatial directions. The red line represents 
the diagonal terms of the covar ianc e matrix of equilibrium 
looping conformations given by (19 1. Parameters: A'^ = 300 
and a = 4.33. 



Then, a second hypothesis of the non-Markovian the- 
ory is that the covariance matrix of the splitting distri- 
bution is approximated by the covariance matrix of equi- 
librium looping conformations given by Eq. (19). We 



represented the diagonal elements ajj of this covariance 
matrix obtained in the theory and the simulations on Fig. 
|3j As can be observed, the stationary covariance approx- 
imation is excellent for almost all modes whose number is 
larger than 3-4, but is not fully accurate for small mode 
numbers. We can therefore expect that the theory does 
not predict correctly the actual polymer conformation at 
the length scale associated to the first modes, which is of 
the order of \/N. However, all the other diagonal terms 
ajj are very well described by the stationary covariance 
approximation. We also investigated the values of the 
first non-diagonal terms (tJj_|_2, which are well described 
by their stationary value for small mode numbers. For 
larger values of the mode numbers, the noise of the simu- 
lated conformation (coming from statistical error and the 
finite value of the time step) is too large to even define 
the sign of the correlation coefficients ^'^d to con- 



clude whether the stationary covariance approximation 
is accurate for these correlations. Following these com- 
ments, we deduce that the stationary covariance approx- 
imation seems well supported by the comparison with 
simulations, although it could be supposed to fail to de- 
scribe the polymer conformation at the large length scale 
x/]V. 
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Markovian 



10 100 

mode number j 

FIG. 5. Average value of the modes mj at the instant of 
end-to-end cyclization, for the same parameters as in figure 
|6] Only the modes for j even are shown (the other modes 
vanish). Symbols: simulations. Red line: non-Markovian 
theory. Green thick da she d line: asymptotic relation of the 
non-Markovian theory (37 1, proportional to Magenta 



thin dashed line: Markovian approximation (26 1. The sign of 
the Markovian expression of mj has been inverted. Parame- 
ters: N = 300 and a = 4.33. 

We now focus on the average shape of the polymer at 
the instant of reaction, which is the key quantity calcu- 
lated in the non-Markovian theory. The average polymer 
shape is described by the average spectrum of the reactive 
conformations formed by the ensemble of the values mj, 
which is represented on Fig. [sj As can be observed, the 
predictions of the non-Markovian theory on the struc- 
ture of the average spectrum are qualitatively correct 
for almost all values of the mode number j and there- 
fore all length scales. The non-Markovian theory slightly 
overestimates the values of mj, especially for low values 
of the wave number j. This discrepancy between the- 
ory and simulations possibly comes from our simplifying 
hypotheses of a splitting distribution that is Gaussian 
with the stationary covariance approximation, which is 
not fully accurate for small mode numbers. However, 
the non-Markovian theory is much more precise than the 
Wilemski-Fixman Markovian theory, in which the values 



of are given by their stationary value [Eq. (26)]: the 



value of these modes have the wrong sign and are an un- 
derestimation of up to two orders of magnitudes of the 
actual average value of the modes (Fig. [s]). Coming back 
to the the space of monomer positions instead of modes, 
we can describe the average shape of the polymer by the 
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function (zi)7r = Y^^=2Qii™'^j which represents the av- 
erage value of the spatial position of the monomer in 
the chain in the direction of reaction, and which is repre- 
sented on Fig. [6] Importantly, the average shape of the 
polymer is found to be qualitatively predicted by the non- 
Markovian theory (and not by the Markovian approxima- 
tion) , but this theory overestimates the magnitude of the 
polymer elongation at the instant of reaction. This fail- 
ure of the theory at the large length scale ^/N is eventu- 
ally related to the stationary covariance approximation, 
which fails at these length scales. The elongation of the 
polymer on average, as apparent on the average function 
(zi)^ on Fig. [6j can also be seen directly on the pictures of 
the polymer reactive conformations, as shown in Fig. [T^ 
and[T]D, where we represented conformations issued from 
simulations and from the splitting distribution ttq of the 
non- Markovian theory. In these conformations, the poly- 
mer tends to be much more elongated in the direction 
of the reaction than in the equilibrium looping confor- 
mations shown on Fig. [ij:. The fact that the polymer 
does not have to wait for reaching an equilibrium loop- 
ing conformations so that the two end monomers come 
into contact implies a reaction kinetics that is faster than 
predicted by the Markovian Wilemski-Fixman theory. 



rescaled reaction radius a = a/\/N \s held constant 



simulation 
non-|Vlarl<. 
|\/1arl<ovian 




100 200 
monomer position i 



300 



FIG. 6. Average position l^Zi) of the i monomer at the 
instant of first cyclization in the direction of reaction, as a 
function of the position of the monomer in the chain i. Pa- 
rameters: A'^ = 300 and a — 4.33. Blue symbols: results of 
simulations, averaged over 2440 realizations. Red line: non- 
Markovian theory. Green dashed line: Markovian (Wilemski- 
Fixman) theory. Parameters: A'^ = 300 and a = 4.33. 

We now discuss some of the properties of the average 
spectrum mj of the reactive conformations. As can be 
seen on Fig. [5j the coefficients mj decrease as a power 
law of over about one decade, with an exponent (3 
that is clearly less than 2. From Eq. ([23|, we can de- 
rive an analytical argument (presented in Appendix [d]) 
that provides the asymptotic behavior of mj for large j, 
when the number of monomers tends to infinity while the 



21/6 ^1/3 



4/3 



-N. 



(37) 



The value of nij predicted by this expression agrees rea- 
sonably well with the structure of both the theoretical 
and simulated spectrums shown on Fig. [5] Such a slow 
decrease of the spectrum is transferred to the average 
shape (zi)7r, whose first derivative seems to be infinite at 
i — 1 and i — N on Fig. [6] This means that the first and 
last monomers of the chain are on average very shifted 
from the position of the reactive zone, thereby confirm- 
ing the image of a polymer that forms an elongated loop 
at the instant of the reaction. When the size of the re- 
active region gets smaller (a — )■ 0) , (371 is not valid any- 
more, and is probably replaced by a law mj ^ N/j^/^, 
although our preliminary calculations suggest the exis- 
tence of a logarithmic correction to this law. The differ- 
ence between the slopes 3/2 and 4/3 is however difficult 
to detect in the simulations, where the data are noisy 
and the power-law behavior holds only over a decade. 

In the limit N ^ oo, we can also get the qualitative 
form of the asymptotic behavior of the reaction time. 
We assume that — > oo at fixed value of the rescaled 
reaction radius a — a/\/N. The eigenvalues are well 
approximated by: Aj ~ (j — l)^7r^/Af^, while the coef- 



ficients hj are given by: bj 



-ypv if J 



and 



vanish for odd values of j. The time is rescaled by the 
power of N that corresponds to the Rouse time: we pose 
T = t/N"^. Then, we find that the scafing M, = mJ_^_^/N 
is the correct scaling for the moments of the splitting 
distribution, as it leaves Equation. (23) invariant on 
N. We also introduce the rescaled reactive trajectory: 
Yt,{t) = RyrirN'^) /y/N , and the rescaled mean square 
displacement ^'(r) = Nip{t). Then, the reaction time is 
given in the limit of large N by the relation: 



T 



dT 



1 



(27r4')3/2 



exp 



y2 

^ TT 

2^ 



Zi~a,l) 



(38) 



As the right-hand side term of this equation does not de- 
pend on N, we deduce the scaling relation T ~ N'^F{d) 
for large N. The function F does not diverge for small 
values of its arguments, implying that for a small size of 
the reactive region, the reaction time reads T ~ 7V^F(0). 
This scaling relation is of the same type as the one that 
is deduced from the Markovian Wilemski-Fixman ap- 
proach (30), but the numerical coefficient F{0) is dif- 
ferent. The estimation of this numerical coefficient is 
rather difficult, because solving the equations for a small 
value of a and a value of N which is not large enough 
makes the result fall into the regime where the reac- 
tion time is T ^ N'^^'^/a and diverges with a. Our 
best estimation is -F'(O) — 0.175, whereas the Markovian 
approximation gives F{0) ~ 0.2003: the difference be- 
tween the two numerical coefficients is about 12%. In 
intermediate regimes of a and N, the results of the non- 
Markovian and Markovian theory differ by about 100%: 
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in all cases, the non-Markovian effects, that result from 
the non-equilibrium reactive conformations of the poly- 
mer, give results that are quantitatively different from 
the Markovian approximation. 

Finally, we also consider the case of a small reactive 
radius: a at fixed N. Then, equation (23 1 predicts 
that the moments are asymptotically proportional 
to a in this limit, and that: mj = abi[—l + l/(AiL^)]. 
The fact that the moments are proportional to a indi- 
cates that they are small and do not play any role in 
the reaction time. This can be seen by considering that 
T in this limit is evaluated by taking the integrand of 
(21 1 in the small time limit, where i?^ ~ a(l -I- 0{t)) has 



the same behavior as in the Markovian approximation. 
Therefore, the result of T as a — )■ is exactly the time 
(31) predicted by the Markovian approximation, and it 
is also the same as predicted by the SSS theory. From 
the scaling argument (111, the time T ~ N^^^/a comes 



from the diffusive regime of R at small times, where the 
monomers behave as if they were disconnected in an ef- 
fective confining volume N^/"^. It is striking that, in this 
limit, all the approaches lead to the same result. It is 
likely to be linked to the fact that the determining step 
of the search process in this regime is diffusive, and that 
diffusion is a Markovian process. 
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FIG. 7. Variation of the reaction time with the position of 
the second reactive monomer when the first monomer is at 
position p — 1 (set of curves that vanish sit q — 1) and p = 34 
(curves that vanish at g = 34). Continuous red lines: non- 
Markovian theory ; dashed green lines: Markovian theory. 
Parameters: A'' — 100 and a — 2.01. Inset: same figure after 
rescaling of the reaction time by the inverse of the j-factor 
(27rL2)3/2. 



VII. EFFECT OF THE POSITION OF THE REACTIVE 
MONOMERS ON THE REACTION KINETICS 

We now investigate the importance of the positions of 
the monomers on the reaction kinetics. We remind that 
p and q are the indexes of the two reactive monomers 
in the chain. In Fig. [7] we show an example of how 
the reaction time varies with p and q for fixed values 
of N and a. As expected, the reaction time vanishes 
when the reactive monomers are close {p = q), and the 
reaction time T(p,q) increases in general when |p — 
gets larger. However, in some regimes, the situation 
is more complex and T{p, q) decreases when \p — q\ in- 
creases. This phenomenon occurs in particular when one 
of the monomers gets closer from the chain end (Fig. [7|, 
and possibly comes from the fact that the motion of an 
end-monomer in the subdiffusive regime is faster than 
for an interior monomer, whose motion is slowed down 
by the presence of two surrounding polymer chains. On 
the inset of Fig [Tj we show the reaction time after it 
has been rescaled by the equilibrium contact probabil- 
ity density, which is also called the j— factor and reads: 
]{p,q) = Pstat(R = 0) = l/(2^|p- From this 

plot, it is quite obvious that the j— factor is not an accu- 
rate quantity to describe the dependance of the reaction 
time with the position of the reactants. This is not a 
surprise for diffusion controlled reactions and can be de- 
duced from the expression of the reaction time (21 ). This 



expression shows that the reaction time is inversely pro- 
portional to the J— factor, but that the other terms also 
contain a lot of information about the reaction time. 



In order to illustrate this fact, we give asymptotic for- 
mulas for q) in the Markovian approximation. First, 
in the limit a — )■ at fixed p and 9, we get the asymptotic 
result: 



T: 



13/2 



0), 



(39) 



which is the generalization of (31) in the case of in- 
tramolecular reactions. In this case, the reaction time 
is effectively proportional to the inverse of the equilib- 
rium contact probability density, with a proportionality 
factor that depends as 1/a. In this regime, the reaction 
time comes from the diffusive behavior of the monomers 
at short times, it does not depend on the precise location 
of the monomers in the chain, but only on the number of 
monomers |p — that separate them. When the number 
of monomers between the reactive groups grows to infin- 
ity however, the regime (39) disappears as the reaction 
time is controlled by the subdiffusive regime. A simple 
scaling argument enables to derive the scaling law for the 
reaction time in this regime. Considering that the vec- 
tor R explores a volume of size L ^ \p — q\^l'^ with a 
subdiffusive walk of dimension — 4, we get that the 
reaction time scales as T ~ L''™ = \p — q\^ ■ Actually, 
as shown below, this asymptotic form corresponds to the 
predictions of the Markovian theory, but with a numeri- 
cal coefficient that depends on the position of the reactive 
monomers in the chain. We show in the appendix [E| that 
the results of the Markovian theory must be discussed 
with the positions of the monomers. If one of the reac- 
tive monomers is an end-monomer (meaning that it is 
separated from the first monomer by a finite number of 
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monomers as N ^ oo), then the reaction time reads in 
the regime \p — q\ ^ N: 



T^c\p-q{- 



0.38 



(40) 



Note that the coefhcient c has an analytical form that is 
given in the appendix [Ej If the two reactive monomers 
are interior monomers, then the reaction time depends 
on their average position on the chain s* defined by s* = 
(p + l)/{'2.N). We obtain the analytical formula: 



3 iV 

47r \p - q\ 



B(s* 



(41) 



where B{s*) is a numerical function that can be deter- 
mined explicitly (appendix |e| and that is represented on 
Fig. [8| This funct ion describes how the reaction kinet- 
ics between two interior monomers is slowed down when 
the reactive monomers are located deeper in the chain. 
Interestingly, ( [4T| ) indicates that there is a logarithmic 
correction to the scaling T ^ |p — gp. Finally, we note 
that, for s* — > 0, B{s*) diverges as: 




FIG. 8. The function B{s*) appearing in the expression (411 
of the reaction time for two close interior monomers, and its 



expansion for small values of s*, given by (421 



B{s*) 



ln(2s*) + Ko (s* ; Ko ~ 0.74). (42) 



This means that the reaction time between two interior 
reactive groups that are located very close to the polymer 
extremity is approximately given by: 



(43) 



We note that kq ~ 2c, where c is the coefficient c appear- 
ing in the expression (40). This indicates that, even if 



p <^ N, the fact that the reactive monomers are inside the 
chain makes the reaction about twice slower than if the 
reactive monomers were separated by the same number 
of monomers, but with one of the reactive groups at an 
end-monomer. The expressions ( 40 ) , ( 41 ) , ( 43 1 are asymp- 
totically exact under the Markovian Wilemski-Fixman 
approximation, and the exact values of c, and B(s*) 
are given in the appendix [Ej We are not aware of any pre- 
vious work where these expressions are derived, but we 
note that they are similar to those that are found with the 
renormalization group theorjOT. Even if non-Markovian 
corrections are to be expected, these formulas illustrate 
the fact that the reaction time in the diffusion controlled 
regime is not well described by the j-factor, as it varies 
with the position of the reactants in the chain and as the 
scahng of T is \p ~ q\^ instead of \p — q\^^^. 

Finally, we also show a typical shape of a reactive con- 
formation when the reactive monomers are at the interior 
of the chain on Fig. [9j We observe that the function (zi)7r 
has singularities around the positions i = p and i = q, 
and that the average reactive shape of the region of the 
polymer between the two reactive monomers is similar 
to the reactive conformations of a cyclizing polymer ap- 
pearing on Fig. [6| Interestingly, the average shape of 
the parts at the exterior of the chain are also affected by 
the reaction: the whole polymer is therefore much more 



elongated in the direction of the reaction than in an equi- 
librium looping conformation, and this elongation is not 
limited to the part of the polymer between the reactive 
monomers. 
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FIG. 9. Example of average positions of the monomers at 
the instant of the reaction in the case of an intramolecular 
reaction. There are A'^ = 501 monomers, and the reactive 
monomers are in positions p = 250 and q = 376. The reactive 
radius is a = 1.46. 
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VIII. CONCLUSION 

In conclusion, we have presented in this paper a non- 
Markovian theory that describes the kinetics of diffusion 
controUed intramolecular polymer reactions. This the- 
ory highlights the key role that is played by the confor- 
mational statistics of the polymer at the very instant of 
the reaction, which is not considered by classical theories 
of cyclization. For large chains, the reactive conforma- 
tions are elongated in the direction of reaction and are 
very different from equilibrium looping conformations. 
The average reactive conformations are characterized by 
a spectrum that decreases slowly with the wave number: 
the average reactive conformations present irregularities 
around the positions of the reactants in the chain, mean- 
ing that all the monomers that are next to the reactive 
monomers in the chain are at the instant of reaction very 
shifted in space from the position they would have in an 
equilibrium looping configuration. In the case of end-to- 
end cyclization, the polymer forms an elongated loop at 
the instant of reaction. In the case of intramolecular re- 
actions, the whole polymer is elongated, not only the part 
of the chain that lies between the reactants. The fact that 
the polymer does not need to wait to reach an equilib- 
rium conformation where the reactants are brought into 
contact implies a reaction kinetics that is faster than the 
Wilemski-Fixman Markovian theory, which assumes that 
the reactive conformations are equilibrium looping ones. 

We have also reviewed the hypotheses of the existing 
theories of cyclization in the context of our more general 
non-Markovian theory. The SSS theory, which approxi- 
mates the whole chain by a single spring, is exact for very 
small reactive radius but becomes rapidly inaccurate as 
N becomes large, as it does not predict the correct scal- 
ing law T ~ 7V^ for large N. The Wilemski-Fixman the- 
ory implicitly assumes that the reactive conformations 
are equilibrium looping conformations and relies there- 
fore on a Markovian assumption. It predicts the two 
regimes and N^^^/a for the reaction time, but over- 
estimates it quantitatively for large values of N. We 
have given a simple scaling argument that allows to un- 
derstand the origin of these two regimes as the result of 
two-substcps in the reaction, which involve the diffusive 
and subdiffusive behavior of the reactants motion that 
appear at different time scales. Interestingly, the result 
of the renormalisation group theory can be recovered by 
taking the limit d — > 4 of the Wilemski-Fixman expres- 
sion of the reaction time for large N, suggesting that the 
renormalisation group approach also relies on a Marko- 
vian approximation. As a matter of fact, the difference 
between the result of the renormalization group and the 
Welemski-Fixman theory in the limit of large N is only 
2.4%. 

The non-Markovian theory presented in this paper pre- 
dicts the same scaling laws as the Markovian theory, 
but with a different numerical coefficient in the large N 
regime, and is found to be in very good agreement with 
simulations for all the values of parameters that we tried. 



This demonstrates that the fact of taking into account 
the reactive conformations is the essential missing ingre- 
dient of existing Markovian theories. The non-Markovian 
theory still makes approximations, as it assumes the 
Gaussianity of the distribution of reactive conformations 
with a stationary covariance matrix. These hypotheses 
are well supported by the comparison with simulations: 
even if the non-Markovian theory leads to an overestima- 
tion of the average value of the modes at the instant of 
reaction, it catches the correct structure of the spectrum 
of the reactive conformations at all length scales. The 
fact that the Markovian and the non-Markovian theory 
predict the same regimes for the reaction time is not ob- 
vious. Actually, it is not always the case: in the case 
of intermolecular reactions in a ID space, the Markovian 
approximation has recently been shown to lead to incor- 
rect scaling laws that can overestimate the true reaction 
time by several orders of magnitude^^. In the different 
context of the study of other stochastic processes such as 
Fractional Brownian Motion, it has also been shown that 
a Markovian approach can give wrong scaling exponents 
that characterize the first passage time density'^- . 

Finally, we also gave formulas that show how the re- 
action time varies with the positions of the reactants in 
the chain in the Markovian approximation. These for- 
mulas highlight the fact that the reaction time cannot 
be deduced by considering only the contact probability 
density of the two reactants. In the regime of a contin- 
uous chain, for a fixed distance between the reactants, 
the reaction time depends on the location of the reac- 
tants in the chain, and is slowed down when the reactive 
monomers are in the interior of the chain rather than in 
the exterior. 

In this paper, we only considered the simple Rouse 
chain model of polymer, for which the reaction kinetics 
is already non-trivial to determine. In future works, we 
aim at estimating the non-Markovian effects appearing 
in reactions involving complex polymers or general non- 
Markovian processes. 
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Appendix A: Some useful properties of Gaussian processes: 
projection and propagation formulas 

We remind here some properties related to the Gaus- 
sian processes. First, we remind formulas for condi- 
tional probabilities. Let us consider a Gaussian distri- 
bution of N variables (ai,...ajv), with covariance ma- 
trix aij and mean vector m^. Consider also the variable 

X = ^i'^i ~ (^l'^)j which is a linear combination of 
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the original variables a^. Then, the probability density 
distribution of Oi, given that X takes a particular value 
Xq, is a Gaussian. We note mf its average and a*^ the 
covariance matrix of the conditional distribution, given 
by: 



\a\b) 



{{b\m)~X), 



{b\a\h) 
{eMb){eMb) 



(Al) 
(A2) 



where je^) represents the basis vector whose all compo- 
nents vanish except the one in z**^ position, whose value 



is 1. (All and (A2) are called projection formulas. As 



explained in details in a previous work'^^, these formulas 
can be derived from the formulas on conditional distri- 
butions presented in the book of EatoiP^l. 

Then, we also remind how a Gaussian distribution is 
propagated under one dimensional version of the Fokker- 
Planck equation Q, which reads: 



dP 

1h 



N 



d 



(A3) 



Let us assume that a,i t = 0, the distribution 
P{ai, qn ,t = 0) is a Gaussian, with mean vector 
rrii and covariance matrix ct;,. Then, if P satisfies the 



Fokker-Planck equation ( A3 ) , it remains Gaussian at all 



later times t > with a mean vector and a covari- 
ance matrix jij (t) which reacP^t 



-Ait 



A. 



■(1 



(A4) 
(A5) 



The two formulas ( A4 ) and ( A5 1 are called propagation 
formulas. 



Appendix B: The functions (f>{t) aid 

Here, we describe how to derive the effective propa- 
gator P(R, i|{Ro, stat}). Because all coordinates are in- 
dependent, we consider the same problem for the first 
coordinate X of R. Then, the modes can be considered 
as scalars, and our aim is to calculate P(X, t|{Xo, stat}) 
the probability to observe X a,t t given that initially the 
polymer is at equilibrium with an initial position Xq. 
Using the projection formula (All, we find that the av- 



erage TO 
X = Xq writes: 



stat.Xo mode a, 



at equilibrium given that 



TO ■ = Ao 



(Bl) 



Then, using the propagation formula ( A4 1 , we can com- 



pute the average of X at t, for the same initial conditions: 
E(X,t|{Xo,stat},0) 



N 

^6,TOf"''^«e-^'* 

i=2 



^0 



N 

i=2 * 



Xo(j){t). (B2) 



From this equation, we easily deduce the value ([8]) of the 
function (j){t) given in the main text. 

The variance is computed the same way. We call trfj'** 
the covariance matrix of the modes g^, a, - at equilibrium. 
According to the projection formula ( A2 1 , the covariance 



of the modes a^, aj at equilibrium with the condition that 
X ^Xo is: 



{stat,Xo} 



'\b){ej\a'^''^\b) 



(B3) 



Using the equilibrium value af 



5ij/Xi, we find: 



{stat,Xo} 



A, 



(B4) 



We now define 7, 



{stat.Xo} 



the covariance of the modes i,j 



at t, when the polymer is initially at equilibrium with the 



condition X = Xq. Using the propagation formula (A5|, 
we get: 

^{stat,Jfo} ^ ^{stat,Xo}g-(Ai + Aj)t g-2Ajt-j 



Xi Xj 



% (B5) 



{stat,*} 

{stat.Xo} ^YgQ i^g noted 



The last line states that 7 
^{stat,*}^ as it does not depend on Xq. Summing the ex- 



pression (B5| over i,j (after multiplication by bibj) gives 



the expression of il'it), which is the variance of X at t, 
for equilibrium initial conditions with X = Xq'- 



N 



■ij=2 
N ,2 

1=2 * 



N 

E 

i,i=2 



XiXjl? ■ 



(B6) 



Using the expression ([7| for L, and comparing with the 
value (B2| of (t>{t), we find that: 



i,{t)^L^[^~m\ (B7) 

This equation is the value (|8| of V' given in the main text. 
Coming back to the 3D case, because R is a Gaussian 
process, it is totally defined by its variance and average, 
and the effective propagator is: 



P(R,t|{stat,Ro},0) = 



(27rV')3/- 



exp 



(R-</.Ro)^ 



(B8) 
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We also give the behavior of 4> in the hmit iV — oo. We 
pose T = t/N'^ and $(t) = cl){t/N^). We also introduce 
Sp = (p - 1/2) /N and = (q - 1/2) /N, which are the 
coordinate of the reactive monomers in the chain. Then, 
the rescaled function $ reads: 



2N ^ e-^ 
$(r) = 2^[cos(sp7rj) - cos(sg7rj)]^ 



(B9) 



Note that L'^ = \p — q\ = N\sq — Sq\ If r — > 0, the series 
can be transformed into an integral: we pose y — jy^ 
and we obtain: 



1 - $(t) 



2Na 



dy 1 



It y'^'n'^T ^ 



2Na^ 



(BIO) 



where a is the average value (over j) of the slowly vary- 
ing term [cos(sp7rg) — cos(Sq7rj)]^. In the case of two 
monomers at the interior of the chain (0 < Sp < 1 and 
< Sg < 1), we have a = 1. In the case of the cycliza- 
tion, where the two reactive monomers are at the chain 
extremities, we obtain a = 2. In the case of the reac- 
tion between an end-monomer and an interior-monomer, 
a = 3/2. Finally, the anomalous behavior (BIO) of $ is 



transferred to the function ^/j, which is equal to: 



il){t) ~ K^/t ; K = Aa/^ ; K t < iV^ 



(Bll) 



and that defines both T and ttq: 

rP,tat(R/)-Pstat(|a)|R/) = 

j dt J dn[P{Rf,t\Trn,0)P{\a},t\Rf,t;nn,0) 

- P(R/, <|{stat, i?oUr}, 0)P(|a), t|R/, t; {stat, RoUr}, 0)]. 

(CI) 



Integrating (CI) over all conformations, and noting that 



the conditional distributions P(|a), i|R/, ...) are nor- 
malized, we get the following expression for the reaction 
time: 

rPstat(R/) = 

J dt J df7[P(R/,i|7rn,0) -P(R/,t|{stat,PoUr},0)]. 

(C2) 

We can assume without loss of generality that Rj — 
RfUz, with Uz the unit vector pointing in a fixed (ar- 
bitrary) direction. Self-consistent equations for the mo- 
ments will be derived by multiplying ( CI ) by a^^ and 



integrating over the conformations. Some intermediate 
calculations need to be done. First, we calculate the fol- 
lowing integral, which is interpreted as the expression of 
an average quantity over a conditional distribution, and 
is readily deduced from the projection formula ( Al ): 



d|a)ai^Pstat(|a)|P/U2) 



{stat, 7?/ } 



Rfh 



(C3) 



Appendix C: Derivation of the self-consistent equations for 
the moments ml in the non-Markovian theory. 



In this section, we derive the equation ( 23 ) in the main 
text. We remind that il represents the direction of the 
vector R at the instant of reaction, and that 7ro(|a)) is 
the distribution of variables at the instant of reaction, 
given that the direction of R is fl. We note the unit 
vector pointing in the direction 57, while 9 and if refer 
to the polar and azimuthal angles. The key hypothesis 
of the non-Markovian theory is that ttq is a multivari- 
ate Gaussian. For symmetry reasons, the average vector 
Sii at the instant of reaction points in the direction f2 
and is given by m^Ur(O). We also make the simplifying 
assumption that the covariance matrix of the splitting 
distribution is given by the stationary covariance matrix 
(conditioned to a particular value of R, so that it is given 
by Eq. ( B4 ) , and the only unknown variables of the the- 



ory are the average moments mj and the reaction time 
T. In this appendix, we first derive the equations in the 
case that the initial distance between the reactants is 
fixed to a single value Rq, and at the end we will show 
how to adapt the calculation to a distribution of initial 
distances. We start from an integral equation that is de- 
duced from (15) after multiplication by 6{{b\a) — R/), 



Then, using (A4), we note that the average value of at 



at t in the radial direction u^., given that the initial dis- 
tribution is the splitting probability 7rn(|a)), is given by: 
IJ,i{t) = e~^'*. Because we assume that the covariance 
matrix of ttq is the stationary covariance matrix, the co- 
variance of tti, aj at t is equal to 7j^J^''*. Hence, using the 



projection formula (Al), we can calculate the following 



integral (at fixed value of the angle 



d\a)airP{\ai),t\RfUz, t; tt^, 0) 



{{b\n) - Rf COS 9), (C4) 



where we used the fact that the projection ol RfU^ over 
the direction is P/ cos6'Ur. We introduce the reactive 
trajectory P^(t) in the direction of the reaction: 



N 



N 



'iTJiie 



(C5) 



Using ( B5 ) , we get the simplification: 



(e.M-^-^-*}|6)^ ^'(^-f" (C6) 
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and we also introduce fi^' , which is interpeted as the 
average value of the mode in the radial direction, con- 
ditioned to the fact that R = 0: 



From (1C5|,(1C6|),(1C7| we find that (1C4|) can be rewritten 
as: 



/ 



d\a)a„P{\a),t\RfU^,t;7Tn,0) = 



Rf cos 9. (C8) 



With the same method, we calculate the following inte- 
gral: 



J d\a)a,eP{\a),t\RfU^,t;TTn,0) = 



Rf sine. (C9) 



Now, noting that aiz = airCOsO — ajgsin^ and using the 
two evaluations (C8),(C9), we get: 



d\a.)a.,zP{\a),t\RfUz,t; tt^, 0) 



cos 9mfe 



Xiip 



{cos 9 Rrr - Rf). 



(CIO) 



With the same reasoning, we get that: 

J d\a)a,zP{\a),t\RfUz,t;{RoUr,sta.t},0) = 
cosb* . ^^ e ■ ^ — {cos9Ro(p — Rf). 



(Cll) 



Using the relation (B7), we simplify this result: 



d\a)a,zP{\a),t\RfUz,t; {i?oUr, stat}, 0) = 
cos^^(e--^-0) + -^^(^-^^"') 



A.V 



Rf. 

(C12) 



Now, the propagator P{RfUz,t\Trn,0) is given by: 
P{RfUz,t\7Tii,0) ^ ,^ ■T./^ exp-'- 



[27rV']3/2 



(C13) 



We expand this expression at first oder in i?j: 
P{RfU,,t\7rn,0)^ ( 1 + cos + 0(i?^ 



(C14) 



We also get the small Rf expression of the second prop- 
agator P(i?/U2, i|{stat, i?oU,.}, 0): 



P{RfUz,t\{stat, RoUr},0) ~ 



cos6i + 0(i?; )j . 

(C15) 



Using equations (C3),(C10|,(C12),(C14),(C15|, we mul 



tiply ([Cl| by a^^ , integrate it over the conformations and 
develop the result in powers of i?/. The term propor- 
tional to (i?/)*^ will vanish after integration over the angle 
9, so that we write the term proportional to Rf, which 
reads: 



TP.tat(0) 



A.£2 
°° , r d9sme 

JQ 



e 



—fi^' (cos 61) -f 



27rV']3/2 

6,(1 -(/)e-^^*) 



— e 



2^ 



A.V2 ^ 



A.V 



0)(cos6')2 



6,(1 



A,;-0 



(C16) 



Finally, performing the integration over 9, and taking 
account the relation (C2) (written for Rf = 0), we find 
the simplified form of the self-consistent equations that 
define the non-Markovian theory: 



dt 

W^' 



lil°R^ , 6,0(0-6-^'*) 



A,: 



cxp 



0, 

(C17) 



which is the form of the self-consistent equations that 
define the first moments of ttq in the case that the initial 
distance Rq is fixed. The case where Rq is averaged over 
the distribution RQe~^o/i^^ ) /Z is treated exactly in the 
same way, but by keeping the average over Rq at each 
step of the calculation. Then, we obtain the equation: 

0= r^UMl:!^ + M(i^e:^\ ^-H 



^'/^ 



Ai 



R^e-^ b,<hU-e-^^*) R?^ -^"^ 



(C18) 



Taking into account the definition ( 24 ) of G, this equation 



is exactly (23 1 in the main text. 
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Appendix D: Asymptotic value of the moments m" 



Appendix E: Markovian expression of the effect of the 
distance between the reactive monomers 



Here, we describe how to derive the asymptotic form 
( 37 1 of the moments m^, when — > oo and j ^ oo while 



the value of a = a/vN is held constant. We only study 
the case of end-to-end cyclization, with p = 1 and q = N. 



It is straightforward to see that the equation (23) is in- 
variant with N if we use the rescaled variables r = t/N'^, 
Y^ir) = VNR^{t), Xj = (j - l)27r2, 6, = -^/W/N (if 
j odd), ^'(r) N%l;{t) and ^(r) = The correct 

scaling for the moments is Mj — mJ^i/N (we note that, 
trivially, Mj = for j even). With these rescaled vari- 
ables, the equation (23) is independent on A^. It can be 



developed in powers of q. The terms that do not con- 
tain exp(— j'^r) generate algebraic terms in j, and they 
must vanish, yielding a global condition on the unknown 
function Y^^. The remaining terms give the equation: 



1. Case of two reactive monomers in the interior 

Here, we determine the asymptotic value of the reac- 
tion time when the reactive monomers are in positions p 
and q in the chain, with |p — 9I ^ N. We work within 
the Markovian approximation. We introduce Sp — p/N 
and Sq = q/N the relative positions of the monomers in 
the chain, and A = |sp — Sq|, which represents the differ- 
ence of curvilinear coordinate between the two reactive 
monomers, and s* — {p + q)/{2N), which is the average 
curvilinear coordinate of the two reactive monomers. In 
the limit N ^ 00, we can write (j)(t) = $(t), where r 
is the rescaled time (r = t/N'^). The function $(r) is 
deduced from from the definition ([8| and from the value 
^ of the coefficients bi in the limit of large N: 



= 




3 



3 

Z{h, 1) ^ 3*Z(a, 1) 



+ 1 — ■ 
G(«,*) 



(Dl) 



All the terms in this equation can be evaluated in 
their short time limit. We introduce the simplifications: 
^{t) ~ kt, $ ~ 1, Y^ ~ a, Z(a,^') ~ a^'e-"'/^^*), and 
G = (a, *) ~ a^^e"^''/^^*). Keeping only the dominant 
terms for t — >■ 0, we get: 



<i>(r) 



-JY 1^ icos(sp7rj) - cos(sq7rj)J 

A — 1 



(El) 



Using the fact that = AA^ and elementary trigonom- 
etry, we find: 



A 



sm I 1 sm(7rs j) 



T 9 -2 2 



(E2) 



From Equation (30), we get that the reaction time (av- 



eraged over equilibrium initial conditions) in the limit of 
small target size (a ^ /S.^/N) scales as: 



P 



MjO, 



Sjq'^n'^Ky^ 



Inverting this relation leads to: 



(D2) 



M. - (^] /o°°-^-'^^^-'^'^"'^^ (D3) 



T 



dr \ , 
l[l 



1 



$(r)2]3/2 



1 



(E3) 



The right-hand side of this expression does not depend 
on A'', but only on the parameters A and s*, and we are 
looking for the limit A — » at fixed s* . Determining the 
limit of this expression ( E3 ) in the limit A requires 



to know the behavior of $ in the same limit. Developing 
expression ( E2 1 in the limit of small A at fixed r leads 
to: 



where we have posed H{t) = tt^t -|- /{j'^2Ky/T). The 
integrals in (D3) can be evaluated with the saddle point 



method. The position of the saddle point is found by 
solving drH{T*) = 0, which gives r* = [a^/(4Kj^7r^)]^/^. 
Hence, we find that: 



21/65I/3 



4/3 



(D4) 



$(r) ~ G(r,s*)A ; G(r,s*) = 2 ^ sin(^s*jfe-^''"' 

3=1 



(E4) 



Since by definition $(0) = 1, expression (E4) cannot be 



valid for small values of t. It is in fact valid as long as 
T ^ A^. At smaller time scales, we write t = uA^, and 
we investigate the limit A ^ of (E2) at fixed value of 
u. We find: 



wher e th e last equality uses the value ( Bll ) of n. Equa- 
tion ( D4 ) is exactly the asymptotic form p?]) in the main 
text. 



$(uA2 



Fiu) 



dx sin 



(E5) 
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The two expressions ffi4l,(E5) of $ at different time 



scales can be matched by noting that, when w — > oo and 
r — > 0, we have: 



F(u) 



1 



A 



A G{t). 



(E6) 



Let us introduce a parameter e that satisfies the condition 
A^ ^ £ <C 1 and is therefore in the intermediate time 



scale. Then, separating the integral (E3 1 into two pieces, 



and changing of variable in the first one, we get: 
T . o /"^ , / 1 

iV2 



1 



3A2 



*M2 



dr[G(r,s*)] 



(E7) 



Because of the matching condition ( E6 1 , this expression 



does not depend on e, and we obtain the following asymp- 
totic expression for the reaction time: 



A-K A ^ 



(E8) 



where B[s*) is a numerical function of the average posi- 
tion of the monomers s* and is defined as B = Bi + B2, 
with: 



lim 



du 



1 



3 1 
B2 = lim ■! — In - 



[1 -F(u)2]3/2 




(E9) 



The function B(s*) is represented in the main text on 
Fig. [Sj We find numerically that, for s* — >■ 0, we have: 



B{s*) 



An 2s* 



kq, 



(ElO) 



with kq ~ 0.74. Noting that, when s* — )■ 0, s* is ap- 
proximately given by A/2, we deduce that the reaction 
time for two close reactive monomers that are in the in- 
terior of the chain but close to the chain extremity is 



2. Case where one of the reactive monomers is at a chain 
extremity 

We now consider the case where the first of the two 
reactive monomers is located at one chain extremity. In 
this case, we have Sp = 0, and the expression (E2) of 
$(t) becomes: 



8 

A2 



. Anj \ e 
sm 



2 / fn^ 



(Ell] 



When A at fixed r, we have: 



$(r) ^ A^Goir) ; Go(r) = ^ /e-^'^^^^ (E12) 



At the scale r = wA^, we get: 

^fwA^) = Fo(u) = 8 / dx (sin 

A^O ./n V 



X7r\ * e 



2 / X^TT^ 



(E13) 



We note that ^0(1*) ^ l/u'^/^ when m -> 00, while 
Go(t) - 1/t^ / 2 whe n t ^ 0. Then, using the approxi- 
mations (E12),(ET3]) of $ at large and small time scales, 
we find that in the limit A — !■ the reaction time is given 
by: 



^H[l-^^oH2]3/2 ^ 



3A4 



drGoirf. 
(E14) 



We note that Fq {u) ^ Xj-u^l"^ when u — 00, the that the 



first integral in (E14) converges to a finite value in the 
limit ej A^ — >■ 00. The second integral is divergent in the 
limit e — >■ because Go{t) ~ l/r'^/^ for small r ; however 
it remains of order A'^/e'^, which is small compared to 
one, and can therefore be neglected. Finally, we obtain: 



T ~ N^A-" 



du 



1 



[1 - Fo{ur]y^ 



(E15) 



Numerically, we find T = cN'^A^, with c ~ 0.38. The 
fact that c < indicates that the reaction between two 
reactive groups in the interior of the chain is slower than 
a reaction involving one monomer at the exterior of the 
chain. 
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